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DETAILED ACTION 

This communication is responsive to tine application filed 9/30/03. Claims 1-21 are 
pending in the application. Claims 1, 20-21 are independent claims. 

Claim Rejections - 35 USC § 102 
The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that an international application filed under the treaty defined in section 
351 (a) shall have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under Article 21(2) 
of such treaty in the English language. 

Claims 1-7, 9-15, 17-18, 20-21 are rejected under 35 U.S.C. 102(e) as being 
anticipated by Wolton et al. (US 2004/0030741). 

As per claims 1 and 20-21 , Wolton et al. teach 

identifying a compound document as a coherent body of hyperlinked material on a 
single topic as created by a number of collaborating authors - paragraphs 152, 363- 
367, 832. 

analyzing the content and structure of the compound document to find a preferred entry 
point for the compound document - pars. 662-663, 800. 
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processing the compound document as a wliole, including at least one of indexing, 
classification, and retrieval - pars. 432, 512-521, 474. 

processing the compound document from the entry point, including at least one of 
creating at least one of presentation of results from retrieval, summarization, and 
classification - pars. 49-52, 151-154, 831 . 

As per claim 2, Wolton et al. teach the internet, an intranet, and a digital library - par. 
149. 

As per claim 3, Wolton et al. teach 

wherein the body of hyperlinked material is distributed over a plurality of URLs - pars. 
156, 158, 802, 832. 

As per claim 4, Wolton et al. teach 

wherein the identifying includes observing the results of a number of heuristics run on 
the body of hyperlinked material and related hyperlinks - pars. 397 (rules), 402, 792- 
795. 
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As per claim 5, Wolton et al. teach 

wlierein tlie lieuristic includes identifying hyperlinks that link within the same directory 
and include a sufficient quantity of common anchor text - pars. 566-573. 

As per claim 6, Wolton et al. teach 

wherein the heuristic includes identifying hyperlinks that contain linguistic structures that 
indicate relationships between parts of a document including at least one of a list of 
page numbers, and the terms "next", "previous", "index", "contents", and their 
non-English equivalents - pars. 433, 512-521, 1045. 

As per claim 7, Wolton et al. teach 

wherein the heuristic includes identifying external hyperlinks to the same places - pars. 
538, 567. 

As per claim 9, Wolton et al. teach 

wherein the heuristic includes identifying individual URLs having similar structure 
indicating an order of inclusion in the compound document - pars. 163-164, 484. 
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As per claim 10, Wolton et al. teach 

wlierein tlie lieuristic includes identifying a link structure of "wheel" form - pars. 426, 
544-550, 564-567. 

As per claim 1 1 , Wolton et al. teach 

wherein the analyzing includes observing the results of a number of heuristics run on 
the component document and related hyperlinks - pars. 374, 432, 802, 1048. 

As per claim 12, Wolton et al. teach 

wherein the heuristic includes identifying specific filenames that define the entry point, 
including at least one of: "index" and "default" - pars. 432, 662, 800. 

As per claims 13-14, Wolton et al. teach 

wherein the heuristic includes identifying a particular component document in the 
compound document as the entry point because the component document has several 
in-links; wherein the in-links are from outside the compound document - pars. 18, 156, 
434. 
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As per claim 15, Wolton et al. teach 

wlierein tlie lieuristic includes identifying a particular component document in the 
compound document as the entry point because the component document has several 
out-links - pars. 538, 567. 

As per claim 17, Wolton et al. teach 

URLs having common directory components followed by different ending directory 
components - pars. 565-571 . 

As per claim 18, Wolton et al. teach 

wherein the ending directory components contain specific identifying information - pars. 
662, 800. 

Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill In the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 



Application/Control Number: 10/676,918 Page 7 

Art Unit: 2163 

Claim 8 is rejected under 35 U.S.C. 103(a) as being unpatentable over Wolton et 
al. (US 2004/0030741), in view of Brown et al. (US 20040064471). 

As per claim 8, Wolton teaches similarity between words/terms, subjects - pars. 164, 
435. Wolton does not disclose wherein the heuristic includes identifying at least one of: 
similar creation dates and similar last-modified dates. Brown teaches a page has a 
plurality of links to linked pages in the database - pars. 10-11; 45, 47; web pages' 
information such as creation dates can be searched - par. 62. Thus, it would have 
been obvious to one of ordinary skill in the art at the time of the invention to combine 
Wolton's teaching with Brown's teaching in order to better identify the searching pages. 

Claims 16, 19 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Wolton et al. (US 2004/0030741), in view of Gould et al. (US 20050060295). 

As per claim 16, Wolton does not disclose determining a measure of vector distances 
along intra-document links between a particular component document and all other 
component documents in the compound document. Gould et al. teach classifying data 
using distance metric between feature vectors where nodes of data are connected by 
links - pars. 24-25. Thus, it would have been obvious to one of ordinary skill in the art 
at the time of the invention to combine Wolton's teaching with Gould's teaching in order 
to analyze data for better storage and retrieval. 
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As per claim 19, Wolton et al. do not teach numerical scores and the combining 
includes a weighted averaging of the numerical scores into an overall score, and the 
maximum overall score determines the preferred entry point. Gould teaches overall 
score - pars. 54, 60, 65; weight - pars. 56, 59-60, 64. Thus, it would have been 
obvious to one of ordinary skill in the art at the time of the Invention to combine Wolton's 
teaching with Gould's teaching in order to better analyze the data thus, better data 
storage and retrieval. 



Conclusion 

Any Inquiry concerning this communication or earlier communications from the 
examiner should be directed to LINH BLACK whose telephone number is 571-272- 
4106. The examiner can normally be reached on Mon.-Thurs.. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Don Wong can be reached on 571-272-1834. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

LINH BLACK 
Examiner 
Art Unit 2163 

March 21, 2008. 
/don wong/ 

Supervisory Patent Examiner, Art Unit 2163 



